AD-8183  378 


UNCLASSIFIED 


CLASSIFICATION  OF  SELECTED  RADAR  IHAGERV  PATTERNS  USING 
A  BINARV  TREE  CLASSIFER(U)  ARNV  ENGINEER  TOPOGRAPHIC 
LABS  FORT  BELVOIR  VA  N  D  FOX  OCT  BC  ETL-B442 


AD- A 183  370 


ETL-0442 


8IK  File  c 


Classification  of  selected 
radar  imagery  patterns 
using  a  binary  tree 
Classifier 


Neil  D.  Fox 


D71C 

-LECTEi 

AUG  0  6  19871 


October  1986 


APPROVED  FOR  PUBLIC  RELEASE;  DISTRIBUTION  IS  UNLIMITED 


Prepared  for 

U.S.  ARMY  CORP  OF  ENGINEERS 
ENGINEER  TOPOGRAPHIC  LABORATORIES 
FORT  BELVOIR,  VIRGINIA  22060-5546 


87  8  5  oil! 


UNCLASSIFIED 


SECURITY  CLASSIFICATION  OF  THIS  PACE  |TWi«l  Dcre  entered) 


REPORT  DOCUMENTATION  PAGE 


READ  INSTRUCTIONS 
BEFORE  COMPLETING  FORM 


1.  REPORT  NUMBER 

ETL-0442 


2.  GOVT  ACCESSION  NO 


RECIPIENT'S  CATALOG  NUMBER 


4.  TITLE  fmcf  Subtitle) 

CLASSIFICATION  OF  SELECTED  RADAR  IMAGERY  PATTERNS 
USING  A  BINARY  TREE  CLASSIFIER 


s  type  of  report  a  period  covered 

Research  Note 

■Dae. 83. t  Sep  84 


S.  PERFORMING  ORG.  REPORT  NUMBER 


».  AUTHORS 

Neil  D.  Fox 


S.  CONTRACT  OR  GRANT  NUMBER!*; 


».  PERFORMING  ORGANIZATION  NAME  AND  AOORE9S 

U.S.  Army  Engineer  Topographic  Laboratories 
Fort  Belvoir,  Virginia  22060-5546 


10.  PROGRAM  ELEMENT,  PROJECT.  TASK 
AREA  A  WORK  UNIT  NUMBERS 


It.  CONTROLLING  OFFICE  NAME  AND  ADORESS 

U.S.  Army  Engineer  Topographic  Laboratories 
Fort  Belvoir,  Virginia  22060-5546 


12.  REPORT  DATE 

October  1986 


IS.  NUMBER  OF  PAGES 

31 


U.  MONITORING  AGENCY  NAME  •  AOORESy/f  different  /ram  Controlling  Olllce) 


IS  SECURITY  CLASS,  (ol  I  hie  report) 

Unclassified 


IS*.  OECLASSI  FI  CATION/ DOWNGRADING 
SCHEDULE 


IS.  DISTRIBUTION  STATEMENT  fo I  thle  Report) 


IT.  DISTRIBUTION  STATEMENT  (el  tire  MM  tract  an  I  are  it  |n  Stack  29.  II  dlllerenl  treat  Report; 


»•  SUPPLEMENTARY  MOTES 


It  KEY  WOROS  I  Com  Area  an  reaaraa  etde  If  nacaaawrr  «n  d  Identify  by  block  number ) 

Binary  Decision  Tree 
Bayes  Classifier 
Radar  Imagery 
Feature  Selet'tlon 


H  sartACT  iOf  <  me  latara*  a dm  It  *c*a eeeyy  tmd  Identity  by  block  number) 

This  report  tails  the  results  of  classifying  radar  imagery  using  a  binary 
tree  classifier.  It  was  found  that  this  classification  algorithm  works  well 
with  radar  Imagery,  which  would  indicate  a  normal  (Gaussian)  feature  vector 
distribution.  The  number  of  elements  In  each  feature  vector  is  the  limiting 
factor.  <  la  situation  time  Is  negligible  once  the  tree  structure  has  been 
created . 


UNCLASSIFIED 

security  CL ASSfFICATIOI*  of  TniS  Pl  .t 


Dmf  fnli 


t  * 


un  EDIT  ion  OF  t  MOW  II  t»  OBSOLETE 


PREFACE 


This  work  reported  on  was  done  under  DA  Project  4AI61 102B52C,Task  B,  Work  Unit 
0015,  “Automated  Radar  Feature  Extraction." 

The  work  was  performed  during  the  period  December  1983  to  September  1984  under 
the  supervision  of  Dr.  Frederick  W.  Rohde,  Team  Leader,  Center  for  Physical  Sciences,  and 
Dr.  Robert  D.  Leighty,  Director,  Research  Institute. 

COL  Alan  L.  Laubscher,  CE,  was  Commander  and  Director,  and  Mr.  Walter  E.  Boge 
was  Technical  Director  of  the  U.S.  Army  Engineer  Topographic  Laboratories  during  the 
report  preparation. 


Accesion  For  | 

NTIS 

cra&i  yj 

OTIC 

tab  □ 

Unannounced  □ 

Justification 

By _ 

Dist  iO 

.tin-.  / 

Av.. 'cb.rty  0<d*s 

Dist 

Av.J  ,• 

/H 

! 

COPY 

' NS Pf  CTED 

V  6  J 


rCi 


i 

is%i 

it 


V  *. 


CONTENTS 


PREFACE 

ILLUSTRATIONS 

INTRODUCTION 

DESIGN 

IMAGERY  USED  FOR  TEST 
RESULT 
CONCLUSIONS 
REFERENCES 

APPENDIX  A.  Data  for  Creation  of  Decision  Tree 
APPENDIX  B.  Software  Listing 


ILLUSTRATIONS 


FIGURE 

1 

2 

3 

4 

5 

6 


TITLE 

Creating  the  Binary  Decision  Tree 

Flowchart  of  Classifier 

Decision  Tree  for  Image  samples  from 
Huntsville,  AL 

Decision  Tree  for  Image  Samples  from 
Elizabeth  City,  NC 

Classified  Results  for  Image  Samples 
from  Huntsville,  AL 

Classified  Results  for  Image  Samples  from 
Elizabeth  City,  NC 


PAGE 

"3 

3 

6 

7 

8 
9 


1U 


CLASSIFICATION  OF  SELECTED  RADAR  IMAGERY 
PATTERNS  USING  A  BINARY  TREE  CLASSIFIER 


INTRODUCTION 

By  using  hierarchical  clustering,  it  is  possible  to  improve  the  classification  results  of  a 
single-stage  classifier.  The  inefficiency  of  the  single-stage  classifier  is  in  part  due  to  the 
simultaneous  use  of  all  feature  vector  components.  By  using  a  decision  tree,  only  the  features 
best  suited  to  separate  the  classes  at  a  node  are  used. 

A  decision  tree  is  a  set  of  nodes  that  represent  a  set  of  feature  vectors  in  the  training  set. 
The  root  node  contains  all  the  vectors  in  the  set.  This  root  node  is  then  clustered  into  two 
new  nodes  (called  “sons"),  and  hence,  two  new  “subclasses”  are  formed.  The  subclasses  are 
further  clustered  until  only  terminal  nodes  (those  with  a  dominant  class)  remain. 

An  unknown  sample  enters  the  tree  at  the  root  node.  A  decision  rule  is  used  to  classify 
the  feature  vector  downward  in  the  tree  to  either  the  left  or  right  son.  This  process  of  sending 
the  sample  down  the  tree  is  repeated  until  it  reaches  a  terminal  node.  It  then  is  classified  to  be 
the  same  class  as  the  dominant  class  of  that  node. 

Feature  selection  is  done  at  each  node.  Ideally,  all  combinations  of  feature  vector 
components  should  be  tested,  but  this  becomes  impractical  owing  to  time  constraints  when 
the  number  of  feature  vector  components  become  large.  Fortunately,  the  number  of  feature 
vector  components  necessary  to  classify  radar  imagery  can  be  small. 

Since  an  assumption  of  hierarchical  classes  is  made,  a  Bayes  classifier  was  chosen  to 
take  advantage  of  the  normal  distribution  one  expects  in  such  a  system.  Also,  only  the  mean 
vector  and  covariance  matrix  need  be  computed  for  each  node.1 

DESIGN 

The  feature  vector  components  chosen  for  the  tree  are 

1 .  Covariance. 

2.  Skewness. 

3.  The  number  of  lines  detected  by  a  Hough  transform. 

4.  The  average  pixel  value. 

5.  The  number  of  pixels  over  a  threshold  value. 

The  clustering  was  done  with  a  K-means  algorithm,  (k=2). 

The  decision  rule2  is 

Xewjiffd,  (X)>d2(X).  i=|,2  and  j  =  2. 1 
dj(X)=  Inpia,-)-'/:  in |  |  - '/*  (X-tfj)  T  V  <X-«  j> 

Where  wj  is  class  i, 

X  is  the  unknown  sample, 

is  the  covariance  matrix  of  the  i1*1  class,  and 
Mj  is  the  mean  vector  of  the  i1*1  class. 
p(o»j)  is  the  a  priori  probability  of  X  toij. 

'  Jack  K  Mui  and  King-Sun  f*u.  “  Automated  Classification  of  Nucleated  Blood  C  ells  l  sing  a  Binars  I  ree  Classifier."  I  f  f  V 
Transactions  on  Pattern  Analysis  and  Machine  Intelligence.  Vol  PAMI-2.  No  5.  Sep.  pp  429-442.  I9H0 

J  t  tou  and  R  t  Gon/ales.  "Pattern  Recognition  Principles".  Addison- Wesles  Publishing  Compans.  Reading. 

Massachusetts.  1974 
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FIGURE  1.  Creating  the  Binary  Decision  tree. 


INPUT  VECTOR  (unknown  classification) 


FIGURE  2.  Flowchart  of  classifier. 


The  program  was  automated  to  execute  all  feature  selection.  The  selection  was  done  on 
the  basis  of  how  well  classes  were  separated  after  clustering.  The  formula  used  is 


k  k 

ITT  *,  ‘n'^VlTJ  S,  JnloeJn 
n=l  n=l 


where 

I  =  number  of  vectors  at  left  son  from  category  n 
Jn  =  number  of  vectors  at  right  son  from  category  n 
1  =  number  of  vectors  at  left  son 
J  =  number  of  vectors  at  right  son 
k  =  number  of  categories  in  training  set 

A  good  separation  is  indicated  by  a  large  number. 

The  four  categories  of  samples  used  for  testing  the  tree  classifier  are 

1 .  forest. 

2.  field. 

3.  water. 

4.  city. 

The  algorithms  used  in  creating  the  tree  and  the  flowchart  of  the  tree  classifier  are 
shown  in  figures  1  and  2,  respectively. 


IMAGERY  USED  FOR  TEST 


The  proposed  binary  tree  classifier  was  applied  to  two  selected  sets  of  high  resolution 
synthetic  aperture  radar  imagery  taken  over  the  Huntsville,  Alabama,  and  the  Elizabeth  City, 
North  Carolina,  areas  with  the  APD-10  and  the  UPD-4  radar  systems,  respectively.  Sections 
of  the  radar  imagery  were  digitized  and  stored  on  a  disk  unit.  A  Lexidata  System  3400  display 
processor  was  used  to  display  the  images  on  a  cathode  ray  tube.  Software  was  written  to  take 
100  samples  for  each  of  four  terrain  classes  from  imagery  displayed  on  the  screen.  Each  image 
sample  consisted  of  32  by  32  pixels  that  were  located  within  a  section  of  one  particular  terrain 
class.  The  four  classes  considered  were  ( 1 )  cities  (combination  of  commercial  and  residential 
structures,  DLMS  category  #504  FIC  301  and  #505  FIC  401),  (2)  fields  (agriculture  used 
primarily  for  crop  and  pasture  land,  DLMS  category  #510  FIC  950),  (3)  water  (rivers  with 
smooth  fresh  water,  DLMS  category  #510  FIC  940  and  fresh  water  subject  to  ice,  lakes  and 
reservoirs,  DLMS  category  #510  FIC  943),  and  (4)  forests  (mixed  trees,  deciduous  and 
evergreens,  DLMS  category  #5 10  FIC  954).  A  feature  vector  consisting  of  1 5  components  was 
computed  for  each  sample.  These  1 5  components  were  made  up  of  the  first-  and  second-order 
gray  level  histogram  statistics  computed  from  each  sample.  These  components  were  then  used 
as  the  input  for  the  binary  tree  classifier. 


RESULTS 


Two  sets  of  radar  imagery  (Huntsville,  Alabama  and  Elizabeth  City,  North  Carolina) 
were  used  to  create  two  binary  decision  trees.  The  resulting  trees  are  shown  in  figures  3  and 
4,  and  the  results  of  classifying  the  training  sets  are  shown  in  figures  3  and  6. 


Often,  only  one  or  two  feature  vector  components  were  used  at  a  node,  which  illustrates 
the  advantage  of  using  a  hierarchical  approach  rather  than  the  traditional  single-stage 
classifier.  For  example,  although  thresholding  was  an  excellent  feature  vector  component 
when  separating  water  from  field,  forest,  and  city,  it  was  a  poor  feature  with  respect  to  the 
separation  of  city  and  forest. 


The  overall  classification  accuracy  for  the  Huntsville  image  samples  was  99.25  percent 
and  for  the  Elizabeth  City  samples,  98.30  percent. 


CONCLUSIONS 


1 .  A  binary  tree  classifier  can  be  used  to  classify  a  selected  set  of  radar  images  with 
high  accuracy  as  illustrated.  However,  a  new  tree  hierarchy  will  be  needed  if  a  new  set  of 
images  from  a  different  geographical  area  is  to  be  classified  optimally  with  a  reasonably  high 
accuracy. 


accuracy. 

2.  Only  the  most  pertinent  feature  vector  components  should  be  used  at  each  node. 
This  reduces  processing  time  and  improves  classification  accuracy. 


3.  An  automated  classifier  is  practical  if  a  small  set  of  feature  vector  components  is 
used.  For  vectors  with  a  large  number  of  features,  some  human  interaction  in  feature 
selection  may  be  necessary  to  avoid  lengthy  processing  time.  The  create-tree  program  took 
approximately  1 1  minutes  to  finish  a  tree  using  five  features. 


4.  This  work  represents  an  application  of  a  recently  developed  statistical  classification 
method.  A  possible  limitation  is  the  requirement  to  create  a  new  tree  hierarchy  for  training, 
whenever  a  new  set  of  images  is  to  be  classified  with  a  reasonably  high  accuracy. 
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Appendix  A.  Date  for  Creation  of  Decision  Tree 


Node# 

Huntsville,  Alabama 

Features  Used  in  Separation  of  Node 

1 

THRSH,  HOU,  COV 

2 

THRSH,  HOU 

3 

HOU 

4 

AVV,  THRSH,  HOU 

5 

AVV,  SKW,  THRSH 

6 

SKW 

9 

SKW 

10 

SKW 

11 

HOU 

12 

THRSH 

17 

HOU 

22 

THRSH,  HOU 

24 

AVV 

27 

AVV 

Key:  HOU 

—  Hough  Transform 

SKW 

—  Skewness 

COV 

—  Covariance 

THRSH 

—  Threshold 

AVV 

—  Average  Pixel  Vali 

Appendix  A.  (Coat’d) 
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Node# 

1 

2 

4 

5 

6 

7 

8 

9 

10 
11 
13 
15 

19 

20 
21 
24 

27 

28 
31 


Elizabeth  City,  North  Carolina 

Features  Used  in  Separation  of  Node 

SKW 

THRSH 

HOU 

AVV,  THRSH 
AVV,  COV 
AVV,  HOU 
AVV 

SKW,  HOU 

AVV,  SKW 

SKW,  HOU,  COV 

THRSH 

AVV 

AVV 

COV 

SKW 

SKW 

SKW 

AVV,  THRSH 
AVV,  SKW 
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Elisabeth  City,  North  Carolina 


NODE# 

WATER 

FOREST 

CITY 

FIELD 

1 

100 

100 

100 

100 

2 

0 

100 

100 

100 

3 

100 

0 

0 

0 

4 

0 

12 

0 

100 

5 

0 

88 

100 

0 

6 

0 

63 

2 

0 

7 

0 

25 

98 

0 

8 

0 

19 

24 

0 

9 

0 

6 

74 

0 

10 

0 

9 

21 

0 

11 

0 

10 

3 

0 

12 

0 

5 

0 

0 

13 

0 

7 

0 

100 

14 

0 

1 

17 

0 

15 

0 

8 

4 

0 

16 

0 

1 

0 

100 

17 

0 

6 

0 

0 

18 

0 

0 

67 

0 

19 

0 

6 

7 

0 

20 

0 

2 

2 

0 

21 

0 

4 

5 

0 

22 

0 

8 

1 

0 

23 

0 

0 

3 

0 

24 

0 

2 

5 

0 

25 

0 

2 

0 

0 

26 

0 

8 

0 

0 

27 

0 

2 

3 

0 

28 

0 

45 

2 

0 

29 

0 

18 

0 

0 

30 

0 

17 

0 

0 

31 

0 

28 

2 

0 

32 

0 

27 

0 

0 

33 

0 

1 

2 

0 

34 

0 

0 

5 

0 

35 

0 

2 

0 

0 

36 

0 

2 

0 

0 

37 

0 

0 

3 

0 

38 

0 

0 

2 

0 

39 

0 

2 

0 

0 

Vector  Distribution  at  Each  Node  of  Decision  Tree 


12 


Appendix  A.  (Con't) 


Huntsville,  Alabama 


NODE# 

WATER 

FOREST 

CITY 

FIELD 

1 

100 

100 

100 

100 

2 

5 

100 

100 

97 

3 

95 

0 

0 

3 

4 

5 

10 

88 

95 

5 

0 

90 

12 

2 

6 

0 

10 

88 

95 

7 

5 

0 

0 

0 

8 

0 

0 

0 

95 

9 

0 

10 

88 

0 

10 

0 

7 

12 

0 

11 

0 

83 

0 

2 

12 

0 

10 

25 

0 

13 

0 

0 

63 

0 

14 

0 

10 

0 

0 

15 

0 

0 

25 

0 

16 

0 

4 

0 

0 

17 

0 

3 

12 

0 

18 

0 

0 

0 

2 

19 

95 

0 

0 

1 

20 

0 

3 

0 

0 

21 

0 

0 

12 

0 

22 

0 

62 

0 

2 

23 

0 

21 

0 

0 

24 

0 

13 

0 

2 

25 

0 

49 

0 

0 

26 

0 

2 

0 

0 

27 

0 

11 

0 

2 

28 

0 

10 

0 

0 

29 

0 

1 

0 

2 

Vector  Distribution  at  Each  Node  of  Decision  Tree 
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iTREC3  T-00Q04  IS  ON  CR00024  USING  00030  BLKS  R-0000 


FTN4X 


0003 
0004 
000S 
0006 
0007 
0008 
0009 
0010 
0011 
0012 
0013 
0014 
001S 
0016 
0017 
0018 
0019 
0020 
0021 
0022 
0023 
0024 
0023 
0026 
0'  t 
0«>-8 
0029 
0030 
0031 
0032 
0033 
0034 
0033 
0036 
0037 
0038 
0039 
0040 
0041 
0042 
0043 
0044 
0043 
0046 
0047 
0048 
0  049 
0030 
0051 
OP""* 
Ol  J 
0054 
0  055 
0056 
0037 
0038 


PROGRAM  TREC3 

THIS  PROGRAM  READS  DATA  OFF  THE  DISK  WHICH  WAS  CREATED  BY 
PROGRAM  "TREEF".  THIS  DATA  IS  USED  TO  IMPLEMENT  A  DECISION 
TREE. 

DIMENSION  C0V(5.5 .61  >  .MEAN<5.6!>  .IDCBU44)  ,ITREE(61  .2) 
DIMENSION  XIN< 400 .3) .IDCB4(144) ,LABS(4)  .INP2(400> 

DIMENSION  LUOT (5) .LL<4.61 ) . LABL ( 4 ) .WV(5.61 > ,XS(5) 

DIMENSION  INAME(3> .X(3) .INDEX (61) 

DIMENSION  NODE (61 > .MEAN1 (3) .COV1 (5.5) .C0V2(25> 

DIMENSION  MEANS ( 5 ) , NAME ( 3 ) 

REAL  MEAN, MEAN1 .MEANS 

VARIABLE  ARRAYS 


COV  -  COVARIANCE  MATRICES  OF  ALL  NODES 

COV1  -  COVARIANCE  MATRIX  FOR  A  NODE  BEFORE  MASK 

C0V2  -  COVARIANCE  MATRIX  FOR  A  NODE  AFTER  MASK 

INDEX  -  NUMBER  OF  VECTORS  AT  EACH  NODE 

INP2  -  LABELS  OF  VECTORS  AT  A  NODE 

1TREE  -  TREE  STRUCTURE  READ  FROM  TREE  FILE 

LABL  -  THE  CLASS  LABELS  READ  FROM  TREE  FILE 

LABS  -  THE  CLASS  LABELS  READ  FROM  PRP  FILE  (NOT  USED) 

LL  -  NUMBER  OF  VECTORS  IN  A  CLASS  AT  A  NUDE 

MEAN  -  MEANS  OF  ALL  NODES  (REAL) 

MEAN1  -  MEAN  OF  A  NODE  BEFORE  MASK  (REAL) 

MEANS  -  MEAN  OF  A  NODE  AFTER  MASK  (REAL) 

NODE  -  THE  NODE  NUMBERS  OF  THE  TREE  (NOT  USED) 

WV  -  THE  MASKS  FOR  ALL  NODES  READ  FROM  TREE  FILE 
X  -  THE  VECTOR  BEING  CLASSIFIED  BEFORE  MASK 
XIN  -  THE  VECTORS  TO  BE  CLASSIFIED  READ  FROM  PRP  FILE 
XS  -  THE  VECTOR  BEING  CLASSIFIED  AFTER  MASK 

VARIABLES 


IANS  - 
INDX1 
INDX2 
IRGT  • 
KLAC  • 
KLAS  - 
LEFT  - 
LVEC  - 
MAX  - 
MAXN  ■ 
NELE  - 
NODEP 
NUMND 


USER  INPUT :  ANSWER 

-  INDEX  USED  IN  PACKING  VECTOR  VIA  THE  MASK 

-  INDEX  USED  IN  PACKING  VECTOR  VIA  THE  MASK 
THE  RIGHT  NODE  OF  NODEP 

THE  NUMBER  OF  CLASSES  (READ  FROM  PROPERTY  FILE) 
THE  NUMBER  OF  CLASSES  (READ  FROM  TREE  FILE) 

THE  LEFT  NODE  OF  NODEP 

THE  NUMBER  OF  VECTORS  IN  THE  PROPERTY  FILE 
THE  INDEX  NUMBER  OF  THE  DOMINANT  CLASS  OF  A  NODE 
THE  NUMBER  OF  VECTORS  IN  THE  DUHINANT  CLASS  OF  A 
NUMBER  OF  ELEMENTS  BEFORE  PACKING  (USING  MASK) 

-  THE  NODE  AT  WHICH  THE  VECTOR  IS  BEING  TESTED 

-  THE  NUMBER  OF  NODES  IN  THE  TREE  STRUCTURE 


CALL  RMPAR (LUOT ) 

CALL  ERLU(LUOT) 

OPEN  FILES  AND  READ  TREE  STRUCTURE 


n  *•  . 

& 


m- 


Oh* 

Vi*' 

«t‘H\ 

kk 


vs 

>  a>  *w  •  ■  „*•  A  A  A  /»  A  V  A  A*  .V  ’■  *  A  - 
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0039 
0060 
0061 
0*<*2 
L  ,3 
0064 
0065 
0066 
0067 
0060 
0069 
0070 
0071 
0072 
0073 
0074 
0073 
0076 
0077 
0078 
0079 
0080 
0001 
0002 
0003 
0084 
0003 
0084 
0087 
O'^S 
0.  9 
0090 
0091 
0092 
0093 
0094 
0093 
0096 
0  097 
0098 
0  099 
0100 
0101 
0102 
0103 
0104 
0105 
0106 
0107 
01  OB 
0109 
0110 
011  1 
0112 
01*1 
0  4 
0113 
0116 
0117 
01  18 


C 

774 

776 

770 

150 

250 


2 

3 

1 

C 

C 

c 

c 

c 


3 

99 

C 

C 

C 

C 

C 

C 

10 

c 

c 


URITE!LUOT ,774) 

FORMAT  <  *  ENTER  NAME  OF  TREE-FILE") 

READ!LUOT .776) INANE 
FORMAT (3A2) 

MR I TE! LUO T .778) 

FORMAT < "  ENTER  PROPERTY  FILE  TO  BE  CLASSIFIED") 
READ (LOOT .776) NAME 
WRITE(LUOT .150) 

FORMAT ( "  DO  YOU  UANT  A  HARD  COPY  7") 

READ (LUOT .230) IANS 
FORMAT! 1 A!) 

IF( IANS . EO . 1HY)  LUOT-30 
CALL  OPEN  (IDCB.1ERR  INANE) 

CALL  READF! IDCB . IERR .KLA8) 

CALL  READF < IDCB . IERR . NUHND ) 

CALL  READF< IDCB. IERR .LABL) 

CALL  READF (IDCB. I ERR.LL) 

CALL  READF! IDCB. IERR. ITREE) 

CALL  READF! IDCB. IERR. MV) 

CALL  READF ! IDCB , IERR . INDEX ) 

DO  1  N-l .NUHND 

CALL  READF ! IDCB . IERR . NODE(N) ) 

CALL  READF ! IDCB. I ERR .NELE) 

CALL  READF ! IDCB. IERR. MEAN1) 

CALL  READF! IDCB. IERR. COV1) 

DO  2  1-1 .NELE 
DO  2  J-l .NELE 

COV !  I .  j  .  N )  -COV 1 !  I .  J ) 

DO  3  1-1 .NELE 
MEAN! I . N)-MEAN1 ! X ) 

CONTINUE 

CALL  CLOSE! IDCB) 


INPUT  THE  VECTORS  TO  BE  CLASSIFIED 

CALL  0PENUDCB4.IERR  .NAME) 

CALL  READF! IDCB4. IERR .LUEC) 

CALL  REA0F!IDCB4. IERR.LELE) 

CALL  READF !IDCB4 .IERR .KLAC) 

CALL  READF! IDCB4. IERR .LABS) 

CALL  READF ! IDCB4 . lEfcR .  XIN) 

CALL  READF! IDCB4. IERR .INP2) 

CALL  CLOSE (IDCB4) 

DO  1223  IVEC-1 . LVEC 
DO  99  IP-1 .NELE 
X! IP)«XIN! IVEC . IP ) 

PRESENT  NODE  OPERATED  ON  IS  SET  TO  "1" 

NODEP-1 

THE  LEFT  AND  RIGHT  CHILD  OF  EACH  NUDE  IS  FOUND 

LEFT-ITREE! NQDEP . 1 ) 

IRCT- ITREE  <  NODEP  .2) 

SEE  IF  NODE  IS  TERMINAL 
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0119 
0120 
0121 
r  -*2 
L.23 
0124 
0125 
0126 
0127 
0128 
0129 
0130 
0131 
0132 
0133 
0134 
0135 
0136 
0137 
0138 
0139 
0140 
0141 
0142 
0143 
0144 
0145 
0146 
0147 
0  8 
0 1  49 
0150 
0151 
0152 
0153 
0154 
0155 
0156 
0157 
0158 
0159 
0160 
0161 
0162 
0163 
0164 
0165 
0166 
0167 
0168 
0169 
0170 
0171 
0172 
O'  3 
C  .  4 
0175 
0176 
0177 
0178 


C 

IF <  (LEFT.EO. 0) .OR . (IRCT.E0. 0)  )  GOTO  200 
C 

C  FIND  DISTANCE  FOR  LEFT  AND  RIGHT  NODES 

C 

DO  50  I-l .NELE 
DO  50  J-l.NELt 

50  COV1 < I . J )-COV( I . J .LEFT ) 

DO  55  1*1 .NELE 
55  NEAN1 (I)-HEAN(I .LEFT) 

INDX1-0 

INDX2-0 

DO  20  ITEL-l.NELE 
IF! MV! I TEL .NODEP ) . NE . 0 . 0  >  THEN 
INDX1-1NDX1M 
MEANS ( INDX1 ) -MEAN1 ( I  TEL > 

XSdNDXl)-X(ITEL) 

ENDIF 

DO  20  JTEL-1.NELE 

IF  <  UV < I TEL . NODEP  > . NE . 0 . AND . MV ( JTEL . NODEP  > . NE . 0 )  THEN 
INDX2-INDX2+ 1 

COV2 < I NDX2 ) -COV 1 < I TEL . J TEL > 

ENDIF 

20  CONTINUE 

RLFT«INDEX(ITREE< NODEP . 1 ) ) 

ALL V- INDEX (NODEP) 

CALL  MOST <C0V2. NELE. MEANS. XS.U1 . INDX1 . NELE»»2 . ALLV . RLFT ) 

DO  60  I-l .NELE 
DO  60  J-l  .NELE 

60  COV1(I.J)-COV<I.J.IRCT> 

DO  65  I-l .NELE 
65  MEANt  < I ) -MEAN! I . IRGT > 

INDX1-0 

INDX2-0 

DO  2000  ITEL-1 .NELE 
IF<MV(ITEL. NODEP) . NE . 0 )  THEN 
INDX1-1NDX1+1 
MEANS! INDX1 )*MEAN1 < I  TEL) 

X8< 1NDX1 >-X< I TEL) 

ENDIF 

DO  2000  JTEL-l.NELE 

IF <MV( I  TEL. NODEP) . NE . 0 . AND . «V< JTEL . NODEP ) . NE . 0 >  THEN 
INDX2*INDX2+1 

COV2( INDX2)-C0V1 < ITEL. JTEL) 

ENDIF 

2000  CONTINUE 
C 

RRGT> INDEX! I  TREE (NODEP .2) ) 

CALL  MOST (COW2 .NELE  MEANS  XS , M2 . INDX1 . NELE»«2 . ALLW . RRCT ) 

C 

C  MAKE  THE  DECISION  OF  MHETHER  TO  MAKE  TO  LEFT  OR  RIGHT  NODE  THE 

C  PRESENT  NODE 

C 

IF< Ml . LE . M2>  THEN 
NODEP -LEFT 
ELSE 

NODEP-IRCT 
ENDIF 
C 

c 


CLEAR  COM2  AND  MEANS 
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0179 

C 

0180 

DO  9000  1*1 .NELE»»2 

0181 

9000 

C0V2! I )*0 .0 

(  12 

DO  9001  1*1 .NELE 

0183 

9001 

MEANS! X) *0.0 

0184 

C 

0183 

C 

CO  TO  NEXT  NODE 

0186 

C 

0187 

GOTO  10 

0188 

C 

0189 

C 

WRITE  THE  FINAL  DESTINATION  OF  THE  VECTOR 

0190 

c 

0191 

200 

MAX-1 

0192 

MAXN-LL ( 1 . NODEP  ) 

0193 

DO  220  X-l.KLAS 

0194 

IF  ( MAXN . LT . LL < I . NODEP ) )  THEN 

0193 

MAX- 1 

0196 

MAXN-LL! I .NODEP) 

0197 

END  IF 

0198 

220 

CONTINUE 

0199 

WRITE! LUOT .101)  LABL! MAX) .NODEP 

0200 

101 

FORMAT!"  CLASS:  \.  1 A2 .3X ,  13) 

0201 

C 

0202 

C 

REPEAT  BY  ASKING  FOR  ANOTHER  VECTOR 

0203 

C 

0204 

1223 

CONTINUE 

0205 

C 

0206 

STOP 

0717 

END 

o  a 

c 

0209 

SUBROUTINE  MDST(COV .NELE .MEAN . X .W . INDXI . NELEO.ALLV 

0210 

c 

0211 

c 

THIS  IMPLEMENTS  THE  MINIMUM  DISTANCE  CLASSIFIER 

0212 

c 

0213 

DIMENSION  COV!NELEQ) .MEAN! NELE) . X!NELE> 

0214 

DIMENSION  R!3) . TR!5) . R1 !5) 

0213 

DIMENSION  L!3) .M!5) 

0216 

REAL  MEAN 

0217 

c 

0218 

c 

0219 

CALL  INV!COV . INDX1 .DET.L.M) 

0220 

CALL  SUMAT !X .MEAN .R . INDX1  . 1  ) 

u221 

CALL  TRMAT ( R . TR . INDX1 ,1.0) 

0222 

CALL  PRMAT (TR .COV.Rl ,1 .INDX1 .INDXI > 

0223 

CALL  PRMAT!R1 .R ,W.l .INDXI . 1 > 

0224 

c 

0223 

c 

CALCULATE  PROBABLITY 

3226 

c 

0227 

PROB-RORL/ALLV 

0228 

IF ( DET . NE . 0 . 0  >  THEN 

0229 

W*PROB-!ALOG!ABS(DET) ) >/2 . 0-W/2 . 0 

0230 

W— W 

0231 

ELSE 

0232 

W-PROB+20E20/2  0-W/2. 0 

0  1 

W— W 

0234 

ENDIF 

0233 

RETURN 

0236 

END 

0237 

END* 

r 


Appendix  B.  (Con’t) 


TREEF  T*0  0  0  04  IS  ON  CRQ0024  USING  00072  BLKS  R-0000 


FTN4X 


PROCRAM  TREEF 


THIS  PROGRAM  CREATES  A  BINARY  TREE  STRUCTURE  AND 
STORES  XT  ON  DISK.  AT  EACH  NODE  A  K-HEAN8  ALGORITHM 
IS  CALLED  TO  SEPARATE  THE  VECTORS  INTO  TWO  NEW  NODES. 

THIS  PROGRAM  READS  IN  A  PROPERTY  FILE  CREATED  BY  PROGRAM 
' MKPRP ' .  THE  FORMAT  OF  THIS  FILE  IS: 


VARIABLE 


SIZE 


DESCRIPTION 


NVEC 


•  OF  VECTORS 


NELE 


•  OF  ELEMENTS 


KLAS 


LABL 


KLAS 


NVEC  X  NELE  X  2 


NVEC 


•  OF  CLASSES 
NAMES  OF  CLASSES 
VECTORS 

VECTOR  LABELS  /  THEY 
IN  THE  SAME  ORDER  AS 
THE  VECTORS  IN  XX. 


VARIABLE  ARRAYS 


Cl  —  WORK  ARRAY  FOR  MVCOV 

C2  —  WORK  ARRAY  FOR  MVCOV 

COV  -  COVARIANCE  MATRIX  IN  MVCOV 

INI  -  LABEL  BUFFER  FOR  LEFT  NODE 

IN2  -  LABEL  BUFFER  FOR  RIGHT  NODE 

INB  -  LABEL  BUFFER  FOR  INPUT  TO  CLUS 

INDEX  -  THE  NUMBER  QF  VECTORS  AT  EACH  NODE 

IREC  -  STORAGE  FOR  RECORD  THAT  CORRESPONDS  TO  A  NUDE  NUMBER 

ITREE  -  THE  TREE  STRUCTURE 

LABL  -  THE  NAMES  (LABELS)  OF  THE  CLASSES 

LL  —  THE  NUMBER  OF  VECTORS  IN  EACH  CLASS  AT  EACH  NUDE 

MEAN  -  THE  MEAN  CALCULATED  IN  MVCOV.  THIS  IS  A  REAL  ARRAY. 

MEAN1  *  THE  TWO  MEANS  CALCULATED  FROM  CLUS.  THIS  IS  A  REAL  ARRAY. 
OLDM  —  THE  OLD  MEAN  VALUES  IN  CLUS 
W  —  THE  MASKS  FOR  ALL  NUDES 

WOLD  —  THE  PREVIOUS  MASK  VALUE  WHEN  IN  "SO"  ROUTINE 

WORTH  -  STORES  HOW  WELL  SEPARATED  EACH  NODE  IS 

WPRIM  -  THE  BEST  MASK  IS  STORED  HERE  AFTER  "SO*  ROUTINE 

XI  —  THE  VECTOR  BUFFER  FOR  THE  LEFT  NODE 

X2  --  THE  VECTOR  BUFFER  FOR  THE  RTGHT  NODE 

XX  —  THE  VECTOR  BUFFER  FOR  THE  NODE  BEING  CLUSTERED 


VARIABLES 
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3059 
0060 
3061 
C  2 
1063 
0064 
0063 
0066 
3  067 
>068 
0069 

3070 

3071 
1072 
)  073 
3074 
0075 
3076 
0077 
3  078 
3079 
0080 
3081 
0082 
3  083 
0084 
3  085 
0  086 
30*7 
0  3 
3089 
0090 
1091 
0092 
3093 
0094 
0095 
0096 
3  097 
0098 
0  099 
0100 

3101 

3102 
0103 
3104 
0105 

3106 

3107 

3108 

3109 

3110 

3111 
0112 
) ' 

o  1 1 4 

3115 

3116 
i  1 1 7 
1118 


C 

C 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 


c 

c 

c 


c 

c 

c 


c 

c 

c 


c 

c 

c 

c 


c 

c 

3 

c 

c 

c 


333 


335 


ICMND 

IND1 

IN02 

KLAS 

KNODE 

KREC 

N1  — 

N2  — 

NELE 

NODE 

NVEC 

OLDVL 

PCLAS 

SHSH1 

SMSH2 

VALUE 

VIN1 

VIN2 

WHIN 


-  INPUT  FROM  USER  IN  COMMAND  LOOP 

-  NUMBER  OF  VECTORS  IN  LEFT  NODE 

-  NUMBER  OF  VECTORS  IN  RIGHT  NODE 
■  TOTAL  NUMBER  OF  CLASSES 

-  MAXIMUM  NUMBER  OF  NODES  ALLOWED 

-  PRESENT  NUMBER  OF  RECORDS 
LEFT  NODE  NUMBER 
RIGHT  NODE  NUMBER 

-  MAXIMUM  NUMBER  OF  ELEMENTS 

-  NUMBER  OF  NODE  TO  BE  CLUSTERED  OR  OPERATED 

-  NUMBER  OF  VECTORS  IN  PROPERTY  FILE 

-  THE  OLD  VALUE  OF  THE  VARIABLE  "VALUE" 

-  NUMBER  OF  VECTORS  IN  DOMINANT  CLASS  AT 

-  INTERMEDIATE  VARIABLE  FOR  CALCULATING 

-  INTERMEDIATE  VARIABLE  FOR  CALCULATING 

-  MEASURE  OF  THE  CLUSTER.  LARGE  VALUES  ARE  GOOD. 

•  INTERMEDIATE  VARIABLE  FOR  CALCULATING  "VALUE" 

-  INTERMEDIATE  VARIABLE  FOR  CALCULATING  "VALUE" 

-  THE  MINIMUM  VALUE  OF  WORTH (NODE) .  THIS  IS  THE  BEST  VALUE. 


ON 


A  NODE 
'VALUE" 
'VALUE" 


DIMENSION 

DIMENSION 

DIMENSION 

DIMENSION 

DIMENSION 


LU0T(5) .XX (400 .  5) . OLDM(  2.  5>.C1(  5.  5) 

INK400)  ,C2(5.5)  .C0V(5,5)  .MEAN1  (2.5)  .IN2(400) 
LABL(4) . INAME(3) . MEAN(5) ,LL(4.61 ) .INB(400) ,W(5.61 ) 
XI (400.5) .X2(4Q0 .5) .ITREE(61 .2) .INDEX (61 3 .UPRIM(5) 
INAM1 (3) > WOLD ( 5 )  , WORTH (61 ) 

REAL  MEAN .MEAN 1 

DIMENSION  IDCB( 144) . IDCB1 (144) . IDCB2( 1 44) , IREC ( 1 00 ) 


SET  OLD  VALUE  TO  0 


OLDVL-O.O 

SET  LEFT.  RIGHT.  AND  STARTING  NODE  TO  INITIAL  VALUES 


NODE-1 

Nl-0 

N2-1 


SET  MAX.  NODES 
KNODE-61 


INIT .  LU 


CALL  RMPAR(LUOT) 
LU-LUOT ( 1 ) 

CALL  ERLU(LU) 


FORMAT (1A2) 
READ  FILES 


WRITE (LUOT .333) 

FORMAT ( "  ENTER  NAME  OF  PROPERTY  FILE") 

READ(LUOT .3343INAME 
WRITE (LUOT. 335) 

FORMAT ( "  ENTER  THE  FILE-NAME  FOR  THE  TREE  STRUCTURE") 
READ ( LUOT . 334 ) INAM1 
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m 


m 
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3119  334 

3120 
0"M 
3._2 
3123 
J124 
0125 
3126 
0127 

3128  C 

3129  C 
0130  C 
<3131 
0132 
3133 
0134 
3135 
0136  C 
3137  C 
0138  C 
3139 
0140  C 
3141  C 
0142  C 
3143 

0144  1192 

3145  C 

3146  C 

0  7  C 

3  i  .8 
0149  C 
3150  C 
0151  C 
3152 
0153  C 
0154  C 
0155  C 
0156 
0157 
0158 
0159  C 
0160  C 
0161  C 
0162  C 
0163  C 
3164  C 
0165  C 
3166  C 

J 1 67  96 

3168  250 

0169 
3170 


FORMAT (3A2) 

CALL  OPEN( IDCB . IERR . INAME ) 

CALL  READF  < IDCB  ,  IERR . NVEC 3 
CALL  READF! IDCB. IERR .NELE) 

CALL  READF < IDCB. IERR .KLAS) 

CALL  READF( IDCB . IERR ,LABL) 

CALL  READF (IDCB. IERR, XX) 

CALL  READFl IDCB. IERR, INI > 

CALL  CLOSE! IDCB) 

PURGE  OLD  AND  CREATE  NEW  SCRATCH  FILE 

CALL  PURGE(IDCB1 . IERR ,5HXBUFR ) 

CALL  PURGE( IDCB2 . IERR ,5HIBUFC ) 

CALL  CREAT(IDCB1 , IERR. 5HXBUFR ,500. 3. 0.-24) 

CALL  CREAT ( IDCB2 . IERR .5HIBUFC .500 ,3. 0 .-24) 

IF ( IERR . LT . 0 )  UR ITE (LUOT ,50 ) IERR 

SET  FIRST  INDEX 

INDEX ( NODE ) *NVEC 

FILL  LABEL  ARRAY  FOR  NODE  1 

DO  1192  1*1 .KLAS 
LL!I,1>-100 

STORE  VECTOR  NAMES  ON  SCRATCH  FILE 
CALL  WRITF(IDCB2, IERR, INI ,NVEC> 

INITIALIZE  RECORD  NUMBER 
KREC-1 

STORE  DATA  ON  SCRATCH  FILE 

CALL  URITFdDCBl  .IERR  .  XX  .  2*NVEC*NELE ) 

IF < IERR . LT  .  0 )  UR ITE (LUOT .50) IERR 
IREC(1 >*KREC 

ENTER  COMMAND  LOOP.  THE  POSSIBLE  COMMANDS  ARE  : 

LU  —  CHANGE  OUTPUT  DEVICE 
TR  —  TERMINATE  PROGRAM 

SO  —  SEQUENCE  W-MASK  THROUGH  ALL  COMBINATIONS  AND  CREATE  TREE 
CT  —  CREATE  TREE  USING  A  W-MASK  UF  ALL  1'3  (ALL  FEATURES  USED) 

UR ITE (LUOT .250 ) 

FORMAT!"  ENTER  COMMAND"./."??") 

READ<LU0T.3)ICMND 

IF( ICMND.EQ.2HS0)  GOTO  252 


3171 

r 

0.  3 

3174  3000 

0175 
0176 
0177 
0178  C 


IF ( ICMND . EQ . 2HCT )  THEN 
DO  3000  1*1 .NELE 
DO  3000  J*1  .KNODE 
Wd  ,  J)*l  .  0 
GOTO  252 
ENDIF 

IF (ICMND. E0.2HTR)  STOP 
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>179 

0180  256 

3  1 

0.d2 

1183  C 
J 1 84  C 
0185  C 
3186 
0187  C 

3188  C 

3189  C 

3190  252 

0191  4444 

>192 

3193  7721 

1194 

3195 

3196  5994 

3197 

3198 

3199  C 

3200  C 

3201  C 

3202 
0203 

3204  C 

3205  C 

0206  C 
3  7 

02u8 

3209  C 

3210 

0211  50 

3212  C 

3213 
0214 

3215  C 

3216  C 

0217  C 
3218 
(3219  C 
3220  C 
■221  C 
j222  770 

3  223 

;  224  C 
j225 
)226 
'227  C 
i228  C 
3229  C 
i230 
3231 

r  ?  c 
)2wJ  c 

3234  C 
<235 
1236 
J237 

3238  29 


IF ( ICMND . EO . 2HLU)  THEN 
READ(LU0T .*)LU 

IF  <  LU . LE . 0 .OR . LU . GE . 39 )  GOTO  256 
ENDIF 

IF  NO  RECOGNIZABLE  COMMAND  IS  ENTERED .  THEN  ASK  FOR  ANOTHER 
GOTO  96 

SPLIT  THE  NODE  SELECTED  INTO  TUO  NEW  NODES 
NODE-1 

IF< ICMND. E0.2HS0)  THEN 
DO  7721  1-1 .NELE 
U( I .NODE  3-0 . 0 

CALL  SEOUN ( MOLD . U . NODE .KNODE . NELE ) 

ENDIF 

FORMAT </.l OX. "W  MASK  «»>  *.5F2.0) 

Nl-Nl+2 

N2-N2+2 

STORE  LEFT  AND  RIGHT  CHILD  IN  ARRAY.  "ITREE" 

ITREE(NODE . 1 >«N1 
ITREE1NODE .2)*N2 


FIND  CORRECT  RECORDS 

CALL  POSNT ( IDCB1 . IERR . IREC ( NODE > .  1  > 

CALL  POSNT < IDCB2 . IERR , IREC < NODE) , 1 ) 

IF < IERR . LT . 0 )  WR ITE (LUQT . 50 ) IERR 
FORMAT ( "  FILE-ERROR  #-.I4) 

CALL  READFdDCBl  .IERR. XX) 

IF ( IERR . LT . 0 )  WRITE (LUOT ,50) IERR 

READ  LABELS  FOR  VECTORS  AT  THIS  NODE 

CALL  READF < IDCB2 , IERR . INB ) 

CLUSTER  THE  VECTORS  , 

CALL  CLUS ( XX . NELE , INDEX ( NODE ) .XI .IND1  . X2 . IND2 . NODE . N1 , N2 . MEAN  1 
*  .  OLDM  .KNODE NVEC ,  INB  .  INI  ,IN2,U) 

IF< ICMND. E0.2HNX)  WRITE(LU .5994) (W(K .NODE) .K»l .NELE) 

IF ( ICMND . EO . 2HCT)  WR ITE< LU. 5994 ) < W <K .NODE) ,K*1 .NELE) 

STORE  LEFT  AND  RIGHT  NODE  NAMES  IN  LIST 

INDEX(N1 )-INDl 
INDEX ( N2 ) “IND2 

PRINT  THE  RESULTS  OF  THE  CLUSTERING  AND  ASK  IF  IT  IS  TO  BE  STOKED 
DO  27  I-Nl . N2 

IF( ICMND. E0.2HNX)  UR ITE(LU . 29 ) I 
IF ( ICMND . EO . 2HCT )  WR ITE ( LU . 29 ) I 
F0RMAT(16X. "NODE  *" ,13) 
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)239 

3"*0 

3.  1 

1242 

)243 

0244 

)245 

0246 

)247 

)248 

)249 

30 

0250 

C 

)251 

J252 

27 

)253 

32 

3254 

C 

<255 

.1256 

C 

)257 

C 

3258 

C 

J259 

C 

1260 

3261 

1262 

0263 

3264 

0265 

)?  '  S 

32-/ 

0268 

3269 

722 

0270 

C 

0271 

0272 

0273 

0274 

C 

0275 

C 

0276 

c 

3277 

0278 

c 

3279 

c 

0280 

c 

3281 

c 

0282 

0283 

1284 

646 

0285 

0286 

0287 

C 

0288 

0289 

0290 

02'  ‘ 

105 

02  _ 

C 

0  293 

0294 

0  295 

0296 

0297 

0298 

991 

IF ( ICMND . EQ . 2HNX)  WRITE<LU.26>  (LABL(J) . J=1 .KLAS) 

IF ( ICMND . EO . 2HCT )  URITE(LU .26)  <LABL< J ) . J  =  1 .KLAS) 

DO  30  NUMK*1 .KLAS 
LLtNUMK .  I  )=0 
DO  30  J=1 . INDEX ( I ) 

IF(I.EO.Nl)  THEN 

IF< INI <  J ) . EO . LABL ( NUMK ) ) LL ( NUMK . I ) =LL C NUMK . I )  +  1 
ELSE 

IF ( IN2 ( J ) . EO. LABL (NUMK) >LL<NUMK .I)=LL(NUMK .1 >  +  l 
ENDIF 
CONTINUE 

IF( ICMND. EQ.2HNX)  URITE(LU . 32) (LL( INUMB . I > . INUMB-1 .KLAS) 

IF < ICMND. E0.2HCT)  WRITE<LU .32) <LL< INUMB. I ) , INUMB-1 .KLAS) 

FORMAT <12X.4< IX. .13) ./> 

IF ( ICMND. E0.2HS0)  THEN 

HERE  IS  THE  IMPLEMENTATION  OF  A  FORMULA  THAT  WILL 
TELL  HOW  GOOD  A  SEPARATION  IS 

SMSH1 =0 . 0 
SMSH2*0 . 0 
DO  7225  IJ*1.KLAS 
IF<LL(IJ.N1 ) .NE. 0)  THEN 

SMSH1 “FLOAT  <  LL ( I J . N1 > ) *ALQGT < FLOAT ( LL ( I J . N1 > ) >+SMSHl 
ENDIF 

IF(LL(IJ.N2) .NE. 0)  THEN 

SMSH2*FL0AT  <  LL  < I J . N2 ) ) *ALOGT (FLOAT ( LL  < I J . N2 ) > ) +SMSH2 
ENDIF 
CONTINUE 

VIN1 =FLOAT ( IND1 ) /FLOAT ( IND1 +IND2) 

VIN2=FLOAT< IND2 ) /FLOAT < IND1+IND2) 

VALUE=VIN1»SMSH1+VIN2»SMSH2 

SET  VALUE  TO  0  IF  IND1  OR  IND2  IS  0  OR  1 

IF< IND1 .EQ. 0 .OR. IND2.E0. 0 .OR. IND1 . EO . 1 . OR . IND2.E0. 1 )  VALUE-0 . 0 

IF  THIS  IS  THE  LARGEST  VALUE  YET.  THEN  SAVE  THE  CORRESPONDING 
W  MASK 

IF < VALUE. GT.OLDVL)  THEN 
DO  6461  ISU=1.NELE 
WPRIM< ISW)=W< ISW.NODE) 

OLDVL=VALUE 

ENDIF 

CALL  SEOUN< WOLD .W .NODE .KNODE ,NELE) 

I QUIT  * 1 

DO  1055  1  =  1  ,  NELE 

IF<W<I  .NODE) ,EO. 1 )  IQUIT  =  0 

IF  < IQUIT . EO . 1 )  THEN 
ICMND=2HNX 
VALUE-0 . 0 
OLDVL=0 , 0 
DO  9911  1  =  1  .NELE 
W<I ,NODE)=UPRIM<I) 
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1299  ENDIF 

)~'0  GOTO  770 

)w„l  ENDIF 

3302  C 

3303  C 

3304  C  STORE  INFORMATION  ON  FILE 
0305  C 

1306  C  MOVE  TO  EOF 
0307  C 

3308  CALL  POSNT ( IDCB1 .IERR.KREC+1,1) 

0309  CALL  POSNT (IDCB2 . IERR .KREC+1  .1) 

0310  C 

3311  C  INCREMENT  RECORD  NUMBER 
0312  C 

3313  KREC-KREC+1 

0314  C 

3315  C  STORE  RECORD  • 

0316  C 

3317  1REC(N1 3-KREC 

0318  C 

3319  IF(IERR.LT.O)  WRITE (LUOT ,50 ) IERR 

0320  C 

0321  C  WRITE  CLASS  ONE  VECTORS  TO  FILE  (THESE  ARE  FROM  THE  LEFT  NODE) 

0322  C 

0323  CALL  WRITF< IDCB1 . IERR .XI . 2*NVEC*NELE ) 

0324  CALL  WRITF < IDCB2 . IERR . INI .NVEC) 

0325  C 

0  >  IF(IERR.lT.0)WRITE(LUOT.50)|ERR 

03c7  C 

3328  C  INCREMENT  RECORD  * 

0329  C 

0330  KREC=KREC+1 

0331  C 

3332  C  STORE  RECORD  * 

0333  C 

3334  IREC(N2)*KREC 

0335  C 

0336  C  WRITE  CLASS  TWO  VECTORS  TO  FILE  (RIGHT  NODE  VECTORS) 

0337  C 

0338  CALL  WRITF( IDCB1 . IERR  .X2 . 2*NVEC*NELE) 

0339  CALL  UR ITF ( IDCB2 . IERR , IN2 , NVEC ) 

3340  C 

J341  IF(IERR.LT.O)  WR ITE(LUOT . 50 ) IERR 

0342  C 

)343  C  CALCULATE  THE  STATUS  OF  EACH  NODE 

1344  C 

3345  DO  882  I»1 .N2 

0346  WORTH( I  )»0 . 0 

3347  P INDX=INDEX ( I > 

3348  IF ( INDEX ( I ) . NE . 0 )  THEN 

3349  PCLAS-0.0 

>350  DO  881  J»1 .KLAS 

J?"-  881  IF ( PCLAS .LT.LL(J.I))  PCLAS-LL ( J . I > 

-  IF  ( PCLAS  .  NE  .  0  )  WORTH(  1 ) =PCLAS#ALOGT  (PCLAS/PINDX) 

'353  IF ( INDEX ( I ) -PCLAS . EQ . 1 )  WORTH(I)*0.0 

1354  ENDIF 

0355  882  CONTINUE 

>356  24  FORMAT (3X.20I3./) 

0357  26  FORMAT (13X,4(2X.1A23  ) 

>358  C 
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0359 

C 

1360 

C 

0  1 

C 

):>o2 

C 

0363 

0364 

C 

0365 

0366 

0367 

0368 

1369 

0370 

C 

0371 

)372 

9933 

0373 

1374 

0375 

4923 

0376 

C 

1377 

C 

0378 

C 

1379 

1380 

Sill 

0383 
0384 
0385 
0386 
o  f 
0388 
0389 

100 

0390 

C 

0391 

C 

0392 

0393 

0394 

C 

0395 

C 

0396 

C 

0397 

0398 

0  399 

C 

040  0 

C 

0401 

C 

0402 

0403 

c 

J404 

1405 

c 

1406 
>407 
0408 
)  409 
1410 

91 

j41  1 

1' 

6119 

>4.3 

6117 

1414 

)415 

>416 

>417 

6116 

THE  INITIAL  MINIMUM  VALUE  IS  SET  TO  THE  PROVIUUSLY  CREATED  LEFT 
NODE.  THIS  IS  DONE  TO  INSURE  THAT  A  NODE  WITH  ITREE  ROOTS  OF  0 
IS  SELECTED. (EC.  NO  NODES  ALREADY  DONE  CAN  BE  DONE  AGAIN.) 

UMIN»WORTH(Nl ) 

DO  9933  1-2. N2 

IF ( ITREE ( I . 1 ) . EO . 0 . AND . ITREE ( I . 2 ) . EO . 0 . AND . WORTH ( I ) . LE . UMIN) THEN 
UMIN»WORTH(I> 

NODE* I 
ENDIF 
CONTINUE 

IF ( ICMND . EO . 2HNX)  ICMND-2HS0 
FORMAT (15X. "THE  NEXT  NODE  IS”. 13./) 

IF<N2. LT.KNODE.AND.WMIN.NE. 0.6)  WRX7E(LU.4923>  NODE 
IF <N2 . LT . KNODE . AND . WMIN. NE . 0 . 0 )  GOTO  4444 

PURGE  OLD  FILES  AND  STORE  NEW  DATA  IN  THEM 

CALL  PURCE(  IDCB IERR  ,  INAM1 ) 

CALL  CHEAT ( IDCB , IERR . INAM1 .300,3.0,-24) 

CALL  WRITF (IDCB. IERR. KLAS. 1> 

CALL  WRITF (IDCB , IERR , N2 . 1  ) 

CALL  WRITF ( IDCB, IERR .LABL .KLAS) 

CALL  WRITF( IDCB. IERR .LL .KLAS»KNODE) 

CALL  WRITF( IDCB . IERR .ITREE . 2*KN0DE ) 

CALL  WR I TF ( IDCB . IERR . W . 2»KN0DE*NELE ) 

CALL  UR ITF ( IDCB .IERR . INDEX .KNODE > 

DO  91  NODE-1. N2 
INND-INDEX(NODE) 

POSITION  AND  READ  FILES 

CALL  POSNT ( IDCB1 . IERR . IREC ( NODE > .1) 

CALL  READFdDCBl  ,  IERR  ,XX) 

WRITE  NODE  *  AND  *  OF  FEATURES  AT  THAT  NODE 

CALL  URITFdDCB, IERR. NODE, 1) 

CALL  WRITF (IDCB , IERR ,  NELE , 1 ) 


CREATE  MEAN  VECTOR  AND  COVARIANCE  MATRIX 
CALL  MVCOV ( XX , MEAN , COV .Cl . C2 , NELE , INND . NVEC > 

CALL  WRITF ( IDCB . IERR . MEAN . NELE*2 ) 

CALL  WRITF ( IDCB , IERR .COV. <NELE«*2)»2> 

CALL  CLOSEdDCBl) 

CALL  CLOSE (IDCB2) 

CALL  CLOSEdDCB) 

WRITE(LU .61 19 ) (LABL (I ) . 1-1 ,KLAS ) 

FORMAT (57X.4(2X.l A2 ) ) 

DO  6117  1-1 . N2 

WR ITE (LU .6116)  I, (ITREE (I .J) .J-l ,2) . (LL(INUM.I) . INUM-1 .KLAS) 
FORMAT (1  OX . "NODE*" ,I3,5X. "LEFT--)" .13 ,5X . "RIGHT — > " . 13 .7X , 
*4( 1 X . 13) ) 

STOP 

END 
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SUBROUTINE  HVCOV 


0419 

0420 

'  T1 

li-,22 

0423 
0424 
0425 
0426 
0427 
0428 
0429 
0430 
0431 
0432 
0433 
0434 
0435 
0436 
0437 
0438 
0439 
0440 
0441 
0442 
0443 
0444 
0445 
0446 
0  7 
0*»-»8 
0449 
0450 
0451 
0452 
0453 
0454 
0455 
0456 
0457 
0458 
0459 
0460 
0461 
0462 
0463 
0  464 
0465 
0466 
0  467 
0468 
0  469 
0  470 
0471 
)/  ■» 
Jv  J 
0474 
0475 
0476 
0  477 
0  478 


C 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 


100 


20 

10 


30 

40 

50 


60 

70 


80 

90 


C 

C 

C 

C 


C 

C 

C 

C 

C 

C 


A - MEAN  VECTOR 

NFC  -  DIMENSION  OF  VECTORS 

Cl .C2  -  BUFFERS 

C  -  COVARIANCE  MATRIX 

NDATA--  NUMBER  OF  DATA  POINTS 
X  -  INPUT  VECTORS 

SUBROUTINE  TO  COMPUTE  MEAN  VECTOR  AND  COVARIANCE  MATRIX 
BASED  ON  THE  INCOMING  TEXTURE  MEASUREMENT  VECTOR. X 

SUBROUTINE  MVCOVtX .A,C .Cl ,C2 .NFC .NDATA .NVEC) 

DIMENSION  C(NFC.NFC) ,C1 <NFC ,NFC ) , C2<NFC .NFC) ,A<  NFC ) 

DIMENSION  X< NVEC. NFC) 

DO  100  1-1  ,NFC 
DO  100  J-l ,NFC 
C1<I,J)>0.0 
C2< I . J)-0 . 0 
DO  10  I-l.NFC 
A( I )-0 . 0 
Z-NDATA 

DO  20  J-l .NDATA 
A( I )— A( I)+X(J,I) 

A( I )*A( I )/Z 

DO  50  1-1, NFC 

DO  40  J-l .NFC 

C(I.J)-0.0 

DO  30  K-l , NDATA 

Cl<I.J)-Cia.J)+X(K  ,I)»X<K,J) 

C1<I,J)-C1(I,J)/Z 
CONTINUE 
CONTINUE 
DO  70  1-1  .NFC 
DO  60  J-1 .NFC 
C2(I .J)-A<I)*A(J) 

CONTINUE 

CONTINUE 

DO  90  1-1 .NFC 

DO  80  J-l.NFC 

C(I  .J)-Cl<I.J)-C2<I .J) 

CONTINUE 

CONTINUE 

RETURN 

END 

THIS  SUBROUTINE  TAKES  A  GROUP  OF  VECTORS  AND  SEPARATES  THEM 
INTO  TWO  CLASSES.  THIS  IS  DONE  WITH  A  K-MEANS  ALGORITHM. 

SUBROUTINE  CLUS1X .NELE .NVEC ,X1 . IND1 . X2 , IND2 . NUDE . N1 . N2 .MEAN 
• , OLDM , K  NODE , NVE2 , 1 NB , I N 1 , 1 N2 . W > 

THIS  SUBROUTINE  TAKES  AN  INPUT  VECTOR  SET."  X  ".  AND 
CLASSIFIES  IT'S  MEMBERS  INTO  ONE  OF  TWO  REGIONS 
DEFINED  BY  "XI"  AND  *X2"  VIA  A  K-MEANS  ALG.  THIS  ROUTINE 
ALSO  RETURNS  THE  TWO  MEAN  VECTORS  IN  THE  ARRAY  "MEAN". 

DIMENSION  X(NVE2 .NELE) ,X1 < NVE2 , NELE ) ,X2<NVE2 .NELE) 

25 
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0479 

DIMENSION  MEAN ( 2 , NELE ) .QLDM(2 . NELE) . INK  NVE2 ) 

0480 

DIMENSION  INKNVE2)  . IN2(NVE2>  .  W ( NELE  .  K NODE  ) 

0  1 

REAL  MEAN 

0*a2 

C 

0483 

c 

INITIAL  VALUES  ARE  CHOSEN  FOR  THE  MEAN 

0484 

c 

0485 

DO  99  I-l  .2 

0486 

DO  99  J-l.NELE 

1487 

HEAN( I , J  >»0 . 0 

0488 

99 

OLDM( 1  .  J>-0 . 0 

0489 

DO  10  IELE-1, NELE 

0490 

MEANd  ,  IELE >-X(l  ,  IELE) 

0491 

10 

MEAN (2 . IELE )-X ( 2 , IELE) 

0492 

OK-O.O 

)493 

DO  11  1-1 .NELE 

J494 

11 

IF < MEAN ( 1 . I )»W< I , NODE) . NE . MEAN ( 2 . 1 )  •  W  ( I  . NUDE ) ) OK - 1 . 0 

0495 

c 

1496 

c 

IF  THE  MEANS  ARE  THE  SAME.  INCREMENT  THE  ELEMENTS  OF 

0497 

c 

FIRST  MEAN  BY  ONE. 

)498 

c 

)499 

IF(OK.EO.O.O)  THEN 

0500 

DO  1001  1-1 .NELE 

1501 

1001 

MEANd  .  I  ) -MEAN ( 1  . 1  > ♦  1  .0 

0502 

ENDIF 

1503 

c 

0504 

c 

)S05 

c 

INITIALIZE  INDICES  FOR  THE  TWO  NEW  CLASSES 

0506 

c 

)!  ' 

100 

IND1-0 

03u8 

IND2-0 

1509 

DO  20  J-l  NVEC 

0510 

SOI -0.0 

3511 

SO2-0 . 0 

0512 

DO  30  1*1. NELE 

1513 

SO 1 -SOI ♦<X<J.I)»U<I. NODE ) -MEAN < 1 , 1 )«W( I . NUDE >  >  **2 

0514 

30 

SQ2-SQ2+ <  X  <  J . I >  *W< I . NODE ) -MEAN <2 . 1 ) »W ( I . NODE  > ) «»2 

)315 

SOI -SORT (SOI ) 

0516 

S02-SORT (S02) 

0517 

IF  (  S02  LE.  SOI  )  GOTO  50 

0518 

IND1 — IND1 *1 

1319 

DO  40  1*1 .NELE 

0520 

INI < IND1 )«INB< J) 

jS21 

40 

XI ( IND1 .I)-X(J.I) 

0522 

GOTO  20 

•  523 

C 

'524 

C 

j523 

50 

IND2-IND2+1 

'526 

DO  60  1-1 .NELE 

'527 

IN2 < IND2 ) -INB<  J ) 

i528 

60 

X2( IND2 , 1 ) -X( J .1) 

'529 

C 

)530 

20 

CONTINUE 

0531 

C 

J  5 

c 

i5.j 

c 

THE  MEAN  IS  COPIED  FOR  COMPARISON 

1534 

c 

'535 

c 

1336 

1537 

1538 

5 

DO  3  IELE-1 .NELE 

OLDMd  ,  IELE)  -MEAN  ( 1  .IELE) 

0LDM(2, IELE) -MEAN (2 . IELE) 

DO  3  IELE-1 .NELE 

OLDMd  .  IELE  )  -MEAN  ( 1  .  IELE  ) 

0LDM(2, IELE) -MEAN <2 . IELE) 
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0539 

C 

3540 

C 

A  NEW  MEAN  IS  CREATED  FOR  THE  TWO  NEW  CLASSES 

0  1 

c 

It, -.2 

DO  14  K-l .2 

0543 

DO  14  1-1 , MELE 

3544 

14 

HEANOt  .13-0.0 

0545 

DO  IS  1-1 , XND1 

1546 

DO  15  J-l ,NCLE 

J547 

15 

MEANd  .  J3-X1  <  I .  J  3/FLOAT <  IND1  > ♦MEAN(  1  ,J) 

0548 

DO  16  1-1  .  IND2 

3549 

DO  16  J-l .NELE 

3550 

16 

MEAN (2 . J) -MEAN (2, J) +X2 ( I  ,  J  3 /FLOAT ( IND2  3 

0551 

C 

1552 

C 

IF  THE  LAST  ITERATION  IS  THE  SAME  AS  THE  PRESENT  VALUE 

0553 

C 

THEN  THE  SOLUTION  HA8  BEEN  REACHED  AND  THE  3UBR .  RETURNS 

3554 

C 

3555 

FLAC-0 . 0 

0556 

DO  45  K— 1 .2 

3557 

DO  45  1-1 .NELE 

3558 

45 

IF(OLDM(K . I  )*W( I .NODE 3 . NE . HEANOC . 1 )•«( I , NODE  3 3FLAC-1  0 

0559 

IF  (FLAC.E0.1 .03  GOTO  100 

0560 

RETURN 

0561 

END 

0562 

C 

0563 

C 

THIS  SUBROUTINE  DOES  A  BINARY  COUNT  OF  THE  H-MASK .  THE 

0564 

C 

SEQUENCE  IS:  00  (ON  INITIAL  ENTRY) 

0565 

C 

10 

0566 

C 

01 

0  ' 

c 

11 

05©8 

c 

00 

0569 

c 

FOR  A  TWO  BIT  MASK. 

057# 

c 

0571 

SUBROUTINE  SEQUN ( WOLD . U . NODE , KNODE . NELE 3 

0572 

DIMENSION  M( NELE. KNODE 3 , HOLD (NELE 3 

0573 

c 

0574 

c 

STORE  OLD  MASK 

0575 

c 

0576 

DO  10  1-1 .NELE 

0577 

10 

HOLD ( I 3-U( I .NODE) 

0578 

c 

0579 

IF ( WOLD ( 1 3 . EO . 1 . 0  3  THEN 

0580 

W( 1 .NODE  3-1  . 0 

3581 

else 

0582 

U(1 .NODE 3-0 . 0 

3583 

ENDIF 

3584 

c 

>585 

DO  20  J-2.NELE 

0586 

I TOC-1 

>587 

DO  30  1-1 ,J-1 

3588 

30 

IF ( WOLD( 1 3 . EQ ■ 0 3  ITOC-O 

1589 

IF ( ITOC . EO • 1 3  THEN 

>590 

IF(WOLD( J) .EQ. 0)  THEN 

0591 

W(J .NODE  3-1 

)r  1 

ELSE 

05.  J 

W( J .NODE  3*0 

>594 

ENDIF 

3595 

ENDIF 

>596 

20 

CONTINUE 

0597 

RETURN 

3598 

END 
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